Non-intrusive System Level Fault-Tolerance

نویسندگان

  • Kristina Lundqvist
  • Jayakanth Srinivasan
  • Sébastien Gorelov
چکیده

High-integrity embedded systems operate in multiple modes, in order to ensure system availability in the face of faults. Unanticipated state-dependent faults that remain in software after system design and development behave like hardware transient faults: they appear, do the damage and disappear. The conventional approach used for handling task overruns caused by transient faults is to use a single recovery task that implements minimal functionality. This approach provides limited availability and should be used as a last resort in order to keep the system online. Traditional fault detection approaches are often intrusive in that they consume processor resources in order to monitor system behavior. This paper presents a novel approach for fault-monitoring by leveraging the Ravenscar profile, model-checking and a system-on-chip implementation of both the kernel and an execution time monitor. System faulttolerance is provided through a hierarchical set of operational modes that are based on timing behavior violations of individual tasks within the application. The approach is illustrated through a simple case study of a generic navigation system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Non-Intrusive Fault Tolerant Framework For Mission Critical Real-Time Systems

The need for dependable real-time systems for embedded application is growing, and, at the same time, so does the amount of functionality required from these systems. As testing can only show the presence of errors, not their absence, higher levels of system dependability may be provided by the implementation of mechanisms that can protect the system from faults. We present a framework for the ...

متن کامل

Dynamic Replication: The Core of a Truly Non-Intrusive SRAM-based FPGA Structural Concurrent Test Methodology

The increasing use of reconfigurable computing platforms, employing SRAM-based FPGAs, opens exciting new possibilities since they enable the reutilization of the same hardware resources to implement speed-critical computational tasks, without interrupting system operation. Nevertheless, larger dies and the use of smaller submicron scales in the manufacturing of this new kind of FPGAs increase t...

متن کامل

Hierarchical Error Detection and Recovery in a Software Implemented Fault Tolerance (SIFT) Environment

A key issue in the design of reliable distributed systems is how to make the entities that provide the reliability properties of the system, themselves failure resilient. An application executing in such a system is dependent on these entities and hence, it is critical to protect not just the application, but also the components of the fault tolerance layer, through a variety of error detection...

متن کامل

A Concept for Hybrid Fault Injection in Distributed Systems

Fault tolerance is an essential part in the design of distributed computer systems [2]. Thus, testing algorithms for such systems often involves fault injection (FI) experiments to evaluate if the specified fault hypothesis holds for the implementation, i.e., faults can be seen as part of the valid input data of a fault-tolerant system. Basically, the different methods of FI can be grouped into...

متن کامل

Statistical Feature Extraction for Fault Locations in Nonintrusive Fault Detection of Low Voltage Distribution Systems

This paper proposes statistical feature extraction methods combined with artificial intelligence (AI) approaches for fault locations in non-intrusive single-line-to-ground fault (SLGF) detection of low voltage distribution systems. The input features of the AI algorithms are extracted using statistical moment transformation for reducing the dimensions of the power signature inputs measured by u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005